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ABSTRACT 

We present the first application of lens magnification to measure the absolute mass of a galaxy cluster; Abell 1689. The 
absolute mass of a galaxy cluster can be measured by the gravitational lens magnification of a background galaxy popu- 
lation by the cluster gravitational potential. The lensing signal is complicated by the intrinsic variation in number counts 
due to galaxy clustering and shot-noise, and by additional uncertainties in relating magnification to mass in the strong 
lensing regime. Clustering and shot-noise can be dealt with using maximum likelihood methods. Local approximations 
can then be used to estimate the mass from magnification. Alternatively if the lens is axially symmetric we show that 
the amplification equation can be solved nonlocally for the surface mass density and the tangential shear. 

In this paper we present the first maps of the total mass distribution in Abell 1689, measured from the deficit of 
lensed red galaxies behind the cluster. Although noisier, these reproduce the main features of mass maps made using the 
shear distortion of background galaxies, but have the correct normalisation, finally breaking the "sheet-mass" degeneracy 
that has plagued lensing methods based on shear. Averaging over annular bins centered on the peak of the light distribution 
we derive the cluster mass profile in the inner 4' (0.48/i _1 Mpc). These show a profile with a near isothermal surface mass 
density k ~ (0.5 ± 0.1)(#/1') _1 out to a radius of 2.4' (0.28/i _1 Mpc), followed by a sudden drop into noise. We find that 
the projected mass interior to 0.24/i _1 Mpc is M(< 0.24/i _1 Mpc) = (0.50 ± 0.09) x IO^/i^Mq. 

We compare our results with masses estimated from X-ray temperatures and line-of-sight velocity dispersions, as 
well as weak shear and lensing arclets. We find that the masses inferred from X-ray, line-of-sight velocity dispersions, 
arclets and weak shear are all in fair agreement for Abell 1698. 



Subject Headings : Cosmology: gravitational lensing, clusters, dark matter 



1 INTRODUCTION 

The magnitude and distribution of matter in galaxy clusters 
should in principle provide a strong constraint on cosmologi- 
cal models of structure formation and the mean mass density 
of the Universe. In addition a direct image of the mass den- 
sity will tell us much about the relationship between gas, 
galaxies and dark matter and whether light is indeed a fair 
- if biased - tracer of mass. 

Early techniques for estimating the mass in clusters in- 
clude dynamical methods, from the line-of-sight velocity 
dispersion of member galaxies, and X-ray temperature mea- 
surements. However these estimates make some strong as- 
sumptions about equilibrium conditions in the cluster. 



Kaiser and Squires (1993) circumvented this by showing 
that a more direct method of estimating the mass, with no 
underlying assumptions about the dynamical or thermody- 
namical state of the cluster, was to measure the shear field 
in the source distribution of the cluster background (Kaiser 
& Squires 1993; Tyson, Valdes & Wenk 1990, Schneider & 
Seitz 1995 ). On average the shear pattern of a population 
of unlensed galaxies should be randomly distributed. But in 
the presence of a massive gravitational lensing cluster, the 
shear field is polarized. Since the shear field is related (non- 
locally) to the surface mass density the shear can be used 
to estimate the mass distribution - up to an arbitrary con- 
stant. The presence of this arbitrary constant, referred to 
as the "sheet-mass" degeneracy, means that only differen- 
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tiaJ masses can be measured. Shear maps are conventionally 
normalised to the edge of the observed field, or such that the 
inferred mass-density is everywhere positive, and so repre- 
sent a lower limit on the mass. 

Soon after, Broadhurst, Taylor & Peacock (1995; here- 
after BTP) showed that the sheet-mass degeneracy could be 
broken by use of the gravitational lens magnification effect. 
The number and magnitude-redshift distribution of back- 
ground galaxies is distorted by the gravitational field of the 
lensing cluster and in the weak lensing regime this distor- 
tion provides a straightforward estimate of the surface mass 
density. With calibration from off-set fields the cluster mass 
distribution can be properly normalized. 

BTP also suggested that a degraded, but much quicker, 
estimate of the magnification effect could be made from the 
distortion of angular number counts of background sources. 
Broadhurst (1995) found evidence for this distortion in the 
background counts of the cluster Abell 1689, as did Fort, 
Mellier & Dantel-Fort (1997) for C10024. In this work we ap- 
ply the methods developed by BTP and extended by Taylor 
& Dye (1998) to estimate the surface mass density from the 
distortion of angular counts, including the effects of shot- 
noise and galaxy clustering, and van Kampen (1998) on esti- 
mating the surface mass density in the strong lensing regime, 
to Abell 1689. 

The layout of the paper is as follows. In Section 2 we 
describe the magnification effect itself. In Section 3 we de- 
scribe the effects of shot noise and clustering on estimates of 
the surface mass density. In Section 4 we describe how to es- 
timate the surface mass density in the strong lensing regime 
using a local approximations, and introduce a new, self- 
consistent, nonlocal solution for axially symmetric lenses. 
We apply these methods to map out the mass in the cluster 
Abell 1689 in Section 5 and find its profile. Our mass esti- 
mate is compared to other estimates in Section 6 and our 
conclusions are presented in Section 7. 



2 THE MAGNIFICATION EFFECT 

The observed number of galaxies seen in projection on the 
sky is (BTP, Taylor & Dye 1998) 



n' = n A l3 -\l + Q), 



(1) 



where no is the expected mean number of galaxies in a given 
area at a given magnitude. Variations in this mean arise 
from the angular perturbation in galaxy density, O, due to 
galaxy clustering, and from gravitational lens magnification. 
The lens amplification factor is 
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where 



(2) 



(3) 



is the surface mass density in units of the critical surface 
mass, E cr it. The amplitude of the shear field is given by 
7 and the background galaxy luminosity function is locally 
approximated by 



The amplification index, /3 — 1, accounts for the expansion 
of the background image and for the increase in number as 
faint sources are lensed above the flux limit. 

In the absence of galaxy clustering and finite sampling 
effects the background galaxy distribution can simply be 
inverted, via equation (hi), to find the amplification. One 
can then solve equation (El) to find the surface density, with 
some realistic assumptions about the shear. In Section 4 we 
discuss various approximations that allow us to do this. 

However, given a small resolution scale for the surface 
amplification, galaxy clustering and finite sampling will in 
general be an important effect. In the next Section we discuss 
the effects of intrinsic variation in the distribution of the 
background galaxy sources. 



3 GALAXY CLUSTERING NOISE 

The main sources of uncertainty in lens magnification are 
due to shot-noise, from finite sampling, and the intrinsic 
clustering of the background source population which in- 
troduce correlated fluctuations in the angular counts. As 
we are viewing small-angles the clustering properties of the 
background source galaxies are not in general linear, unless 
the depth of background is sufficient to wash out the clus- 
tering pattern. As a result it is not sufficient to make the 
usual assumption that galaxy clustering can be modelled by 
a Gaussian distribution. 

We can account for the effects of shot-noise and non- 
linear clustering by modeling the angular counts by a 
Lognormal-Poisson model (Coles & Jones 1991, BTP, Tay- 
lor & Dye 1998) - a random point-process sampling of a 
lognormal density field. The distribution function of source 
counts is then 
1 



P(n) 



dx 
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where A = Xoe x is the local mean density, i is a Gaus- 
sian random variable of zero mean and variance a 2 and 
Ao = noA f3 ~ 1 e~' 7 ^ 2 correctly normalises the counts. The 
linear clustering variance, a 2 , is related to the nonlinear 
variance by a 2 — ln[l + a 2 ^. We have tested this distri- 
bution against available data and find that it is an excellent 
fit to the distribution of counts in the deep fields. The only 
parameters are the observed count per pixel, n, and the vari- 
ance of the lognormal field. The amplitude of clustering of 
the density field and its dependence of redshift can be esti- 
mated from, e.g., the I-band selected galaxies in the Canada- 
France Redshift Surve y in th e range 17.5 < I < 22.5 (Le 
Fevre 1996; see section 5.2.3). The quantity required is the 
variance in a given area of sky, which can be estimated by 
averaging the observed angular correlation function, u>(6), 
over a given area: 



fi(0) 



d 2 e'u>(9') 



(7) 



n(L) 



(4) 



where Q(0) is the area. 

Our method of approach is then that discussed by BTP. 
At each pixel in a map of the source counts one uses the 
distribution equ. Jra as a likelihood function, C(A\n,a) = 
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P(n\cr, A), assuming a uniform prior for the amplification. 
The surface density is then found from the amplification 
by making some realistic assumption about the shear and 
maximizing the likelihood. In the next Section we discuss a 
number of ways of transforming from the amplification to k 
in the strong lensing regime. 



4 THE STRONG LENSING REGIME 

Transforming from amplification to the surface mass density 
is potentially non-trivial, as we have no shear information. 
One could incorporate this from independent measurements 
of the shear field, but for the present discussion we are in- 
terested in developing a completely independent lensing ap- 
proach. We shall discuss combining shear and magnification 
elsewhere. In principle one could generate a first guess for 
the surface mass density and iterate the amplification equa- 
tion towards a solution of both surface density and shear. 
However, given the small field of view and uncertainties in- 
troduced by parity changes, this can be an unstable prob- 
lem. In addition, as the solutions are in general multivalued, 
we would hope to start from as near to the correct solution 
as possible. In this section we discuss a number of reason- 
able approximations to solve the amplification equation (Q). 
These can be regarded as solutions in their own right, or 
as the first best guess to an iterated solution. We begin by 
discussing the local approximation methods suggested and 
fully tested on simulated clusters by van Kampen (1998). 
Then in Section 4.2 we present a new, self-consistent solu- 
tion to the amplification equation, for k and 7 for an axially 
symmetric lens. 

4.1 Local approximations to the surface mass 
density 

There exist only two local relations between 7 and k that re- 
sult in a single caustic solution of the amplification equation 
(2) which is easily invertible (van Kampen 1998) : 7 = 0, cor- 
responding to a sheet of matter, and 7 = k, for an isotropic 
lens. These two relations have corresponding estimators for 
function of amplification: 

Ko = k( 7 = 0) = 1~TA~ 1/2 , (8) 

m = k{ 1 = k) = 1(1 -TA- 1 ), (9) 

where V = ±1 is the image parity. 

Let us assume that the surface mass density of the lens 
is smooth over some scale. In this case, for a sufficiently 
smooth lens 7 < k (BTP). The equality holds in the case 
of an isotropic lens, for instance the isothermal lens. The 
inequality holds for any anisotropic lens, with the sheet mass 
at the extreme. For a smooth lens these two estimates bound 
the true value k\ < k < no- Before caustic crossing it can 
also be shown that ki < ko < ftweak holds, where A = 
1 + 2K wca k is the weak lensing limit (BTP). Hence the weak 
lensing approximation will overestimate the cluster mass in 
the strong regime, usually by a factor of two (van Kampen 
1998). 

In practice, substructure and asphericity of the clus- 
ter will induce extra shear (e.g. Bartelmann, Steinmetz & 



Weiss 1995) , especially in the surrounding low-s neighbour- 
hood, where substructure is relatively more dominant and 
filaments make the cluster most aspherical. This means that 
the lens will not be smooth for small k, and therefore «i is 
a lower limit for the true k only for the central parts of the 
cluster, in the case the lens parity is known. Van Kampen 
(1998) found it to be a good lower limit only for k > 0.4 (for 
the most massive clusters), while for k < 0.2, ki is usually 
fairly close to the true value. For angle-averaged K-profiles 
ki is a good lower limit for (k)^ > 0.2. All this has no bear- 
ing on acq, which remains a strong upper limit until the first 
caustic crossing. 

An approximation that tries to take these cluster lens 
features into account, while still giving an invertible A(k) 
relation, is (van Kampen 1998): 

7=11-4 a/?. (10) 
V c 

which results in an amplification relation that admits the 
full four solutions: 

A" 1 = |(«- c)(«- l/c)|, (11) 
with caustics at ft = c and 1/c. The solution for k, is then 

Kc = h ( (c2 + x) ~ 5v/(c2 + 1)2 ~ 4c2(1 ~ VA ~^) ■ (12) 

Solutions are set by choosing the parities V,S = ±1 where V 
is the image parity and S is the sign of ((c 2 + l)/2c— k). Note 
that the sheet-like solution is recovered by setting c = 1. 

Figure 1 shows a plot of k versus the inverse amplifica- 
tion, A -1 , for the three estimators. Also shown is the weak 
field approximation. The points are taken from a simulated 
lensing cluster (van Kampen & Kargert 1997) which is of 
comparable size to A1689. It is clear that kq is a strong 
bound, at least until a caustic is crossed, and that Ki pro- 
vides a very good bound for k > 0.2. The weak field approx- 
imation however is extremely bad except in the very weak 
regime (« < 0.1). The two-caustic approximation behaves 
as it is designed to do: a good fit between the other two 
strong lensing estimators for the central parts of the cluster, 
while also modelling the 7 > k behaviour for small k. These 
results are fairly robust over a wide range of clusters and for 
all realistic values of the cosmological density parameter flo . 

4.2 A nonlocal approximation to the surface 
mass density 

An alternative approach is to assume axial symmetry for 
the lens. Because this fixes a non-local functional relation- 
ship between k and 7 (equation |l5| ), we can solve the am- 
plification equation (El) for a self-consistent k and 7 profile. 
Although we shall apply our results to circularly averaged 
data, these results hold for any self-similar embedded set of 
contours. 

We define a mean surface density interior to a contour 
by integration over the interior area, £1(6), 

-<v=W)I/ ' Kie,) (13) 

The deflection angle for the axi-symmetric lens is 

A6 = BR, (14) 
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Figure 1. Scatter plot of the surface mass density, k, versus 
the inverse amplification, A -1 , for a simulated cluster in a CDM 
universe (see van Kampen 1998 for details). The cluster is at a 
redshift of 0.183 and the background population at z = 0.8. The 
cluster was selected to look similar to Abell 1689. The solid line 
is the 7 = strong lensing approximation. Before caustic crossing 
this is a hard bound on the locus of points. The dashed line is the 
7 = k approximation, which is a good lower bound for k > 0.2 
for this cluster. The weak-lensing approximation (dotted— line) is 
seen to be a very bad approximation for k > 0.1. The dot-dashed 
line is a best fit to the simulation for the 2-caustic approximation 
7 = |1 — c| -v/k/c, with c = 0.7. 



and the shear is given by 
7 = 7t = \k-k\. 



(15) 



where the tangential term, 7t, is the only component of shear 
generated. The amplification factor is given by 



A' 1 = |(1-k)(1-2k + «) 



(16) 



One can now simultaneously solve for the surface mass 
density, shear and amplification by series solution. Firstly 
we divide the surface mass into consecutive shells with equal 
separation (any arbitrary separation can be used, we have 
chosen a regular separation for convenience). If we split R 
into an interior term, rj n -i, and a surface term, then for the 
n th shell we have 
2 

K n = t] n -\ + - — K„ (17) 

(n + 1) 
where we have defined 

2 

~i(n + l) 



(18) 



The surface mass density in the n th shell is then given by 

n n = n * (n+1 — (n — l)i]„-i — 
An K 

S\(n - 1 - (n + l)?7n-i) 2 + AnVA- 1 } 1 ! 2 ) , (19) 



where V, S = ±1 are again the image parities. The only free- 
dom we have, for a given amplification profile is the choice 
of the shear on the first shell, 71 — rjo, and the parity. It 
should be noted that given the amplification and having 
fixed the parities one has to ensure that the first 7 satis- 
fies 7 2 > "PA" 1 , to avoid unphysical solutions. The nonlocal 
approximation contains both the sheet and isothermal solu- 
tions as specific solutions. The uncertainty on k and 7 can 
be found by simple error propagation of the uncertainty on 
the measurement of the amplification. 

Having shown in Sections 2, 3 and 4 how, in principle, 
one can measure the surface mass density from angular num- 
ber counts, in the next section we exploit these methods to 
measure the mass distribution in the lensing cluster Abell 
1689. 



5 APPLICATION TO ABELL 1689 

In this section we apply the methods discussed in sections 
2, 3 and 4 to observational data. We begin by describing the 
data. 



5.1 The Data 

5.1.1 Data acquisition and reduction 

The data were obtained during a run in February 1994 at 
ESO's NTT 3.6m telescope, with 10 4 sees integration in the 
V and I bands, and covers 70 square arcminutes on the clus- 
ter. Seeing was similar in both bands, with FWHM of 0.8" 
and a CCD pixel scale of 0.34". The EMMI instrument was 
used throughout. The passbands and exposures were cho- 
sen such that the cluster E/S0 galaxies would be bluer than 
a good fraction of the background, requiring much deeper 
imaging in the bluer passband for detection. The cluster was 
observed down to a limiting magnitude of I = 24. 

The images were debiased and flattened with skyflats 
using standard IRAF procedures. After this there remained 
some large scale gradients of a few percent, probably caused 
by some rotation of the internal lens. We additionally cor- 
rected each separate exposure with a smoothed version of it- 
self, obtained after masking out the cluster and other bright 
objects. Following this we had homogeneous photometry 
across the field (A further discussion of the reduction pro- 
cedure can be found in Bem'tez, et al, 1998, in preparation). 
The zero point was found to be good to 0.1 magnitudes. 
High humidity on a few nights meant that some of the data 
was not photometric, so we calibrated with the photometric 
data. The object detection and classification was performed 
with SExtractor. 



5.1.2 Separation of cluster and background 

To measure the distortion in background counts we must 
first separate the background from cluster members and 
mask off the area they obscure. Cluster galaxies were iden- 
tified from the strong cluster E/SO colour sequence, which 
forms a horizontal band across the colour-magnitude dia- 
gram, shown in figure ^| The sharp upper edge of this band 
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Figure 2. Colour-magnitude diagram for A1689, overlaid with 
colour cuts used to isolate the cluster members from the back- 
ground population; 20 < I < 24, 1.6 < V-I < 3.5 and V < 26.8. 
The strong horizontal band of galaxies is the cluster E/SO se- 
quence. 



represents the reddest galaxies in the cluster. Galaxies red- 
der than this are cosmologically redshifted, and hence rep- 
resent a background population. As well as isolating cluster 
members, this selection should also ensure that any fore- 
ground galaxies are removed. Anything redder than V — I = 
1.6 was selected as a background galaxy. Further colour cuts 
where imposed to ensure completeness of the sample. The 
range of magnitudes was restricted to 20 < / < 24 and the 
V^-band limited to V < 28. Finally we also cut at V—I < 3.5, 
where the reddest galaxies cut off. 

Since the identification of cluster members is important 
to remove contamination of the background sample we also 
checked our colour selected candidates with new data from 
a photometric redshift survey of the same field (Dye et al, 
1998, in preparation). We found general agreement with the 
simpler colour selection. 

Having identified foreground and cluster members we 
produced a mask to eliminate those areas obscured by clus- 
ter members that would otherwise bias the mass estimate. 
To isolate the cluster members for the mask we selected all 
the galaxies in the colour-magnitude diagram lower than 

V — I — 1.6 and less than I = 22. This isolated most of 
the cluster sequence. The remaining galaxies in the region 

V - 1 < 1.6 and I > 22, V < 26.8, we identified as the faint 
blue background population. It is clear from figure ^ that 
the distinction between faint cluster member and faint blue 
background galaxy is rather vague. However, since the faint 
cluster members are also the smallest, the masked area is 
fairly insensitive to the exact division. Figure ^ shows the 
distribution of cluster galaxies and the red background pop- 
ulation. The concentric circles are centered on the peak in 
the cluster light distribution and show the position of the 
annuli used to calculate the radial profile in section 5.4. 




2 4 6 

X (arcmin) 



Figure 3. The masked region of A1689. Cluster members where 
selected using colour information (see text) and then masked over 
so that these regions do not affect the surface density estimate 
of background sources. The total region masked is about 10% of 
the area. The background galaxies are also shown as open circles. 
Superimposed are the concentric bins used to calculate the radial 
profile, centered on the peak in the light distribution. North is up 
and east is to the left. 

5.1.3 Selection by colour- 
Once the cluster galaxies have been isolated, the background 
galaxies may be sub-divided into a red and blue population, 
separated by V — I = 1.6. The observed slope of the lu- 
minosity function for these two populations for / > 20 is 
Pr = 0.38 and /3_g = 1 (Broadhurst 1995; we shall do a 
more accurate fit using our colour cuts in section 5.2.2). 
From equation ([lj we expect that the surface density of red 
galaxies will be suppressed due to the dilation effect, while 
magnification of the faint blue galaxy population will com- 
pensate for the dilation. Hence selecting by colour allows us 
to identify a population of galaxies with a very flat luminos- 
ity function to boost the lensing signal, at the expense of 
a reduction in galaxy numbers. Simple error analysis shows 
that the signal-to-noise varies as (Taylor & Dye 1998) 

S/N = 2\/3 - 1\kA(1 -k + 7'/k')V"(1 + na 2 )~ 1/2 , (20) 

where ' = d/dR. While the signal-to-noise is a linear func- 
tion of the slope of the luminosity function, it only grows 
with the square-root of the galaxy numbers, assuming Pois- 
son statistics. Hence one can get a better signal-to-noise by 
pre-selection of the red background population to boost the 
signal, at the expense of numbers. Equation ( po| ) also shows 
that one can get a better signal by observing to fainter mag- 
nitudes to simultaneously enhance the surface number den- 
sity and reduce the contribution from intrinsic clustering 
(see Taylor & Dye, 1998, for a more detailed discussion of 
observing strategies). 

There is also a practical reason for favouring the red 
galaxy population. While the cluster members are unlikely 
to be redder than the cluster E/SO sequence the distinc- 
tion between faint blue galaxies and cluster members, based 
on selection from the colour-magnitude diagram alone, is 
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somewhat vague. There may be blue cluster members that 
will contaminate the sample of blue background galaxies. 
In the absence of redshift information the blue background 
population is clearly harder to isolate. 

As we have noted the red population has relatively few 
faint counts, so that the expansion term in equation (1) dom- 
inates and there is a net underdensity of red galaxies behind 
the cluster (see Fig. ^ and Fig. Q). Conversely, faint blue 
galaxies are numerous and cancel the expansion effect. As 
expected we found the blue galaxies were uniform across the 
A1689 field. This is a good indicator that it is the magnifi- 
cation effect at work, and not some spurious contaminant, 
for example colour gradients across the field, or large scale 
variations due to clustering. In addition it also indicates that 
the deficit in the red population is not due to dust obscu- 
ration or reddening in the cluster, as this would affect both 
red and blue populations in equal measure. 




5.2 The distribution of background galaxies 

In Figure^ we show the surface distribution of the red popu- 
lation behind A1689, Gaussian smoothed on a scale of 0.35'. 
There are 268 background galaxies. The cluster members 
have been masked out and the masked areas interpolated 
over. The masked region contributes to only ~ 10% of the 
total field. Figure || shows the masked region. The cluster 
center, identified as the peak of the light distribution, is at 
(4.1', 3.6')- 

The angular size of the cluster scales like 



R(6) = 0.87Da(^ c )(6'/1')/i~ 1 Mpc, 



(21) 



where Da(z) = 2(1 — (1 + 2)~ 1/2 )/(l + z) is the comoving, 
dimensionless angular distance in an Einstein-de-Sitter uni- 
verse. Hence at the redshift of Abell 1689, z c = 0.183±0.001 
(Teague et al 1990), one arcminute is about 0.117/i Mpc. 

Figure ^| clearly shows a deficit of galaxies about the 
central peak in the light distribution at (4.1', 3.6'). At = 
0.75' there is an arc of very underdense number counts to 
the South-West of the cluster center, marked by a dashed 
line (The background is somewhat obscured by the cluster 
mask to the North-East of the cluster center). This is clear 
indication of a caustic feature in the background number 
counts, where the number density drops to zero due to di- 
lation. This exactly corresponds to the radius of the blue 
arcs observed by Tyson & Fisher (1995) at = 0.85' (see 
also the radial number counts in section 5.4). This is strong 
evidence that we have detected the magnification effect in 
the background counts. 



5.2.1 The redshift distribution of background galaxies 

The efficiency of lensing varies with the redshift of the back- 
ground source (BTP). Therefore it is important to esti- 
mate the background redshift distribution. Crampton et al 
(1995) find that Canada-France Redshift Survey (CFRS) 
has a median redshift of z = 0.56 for galaxies in the range 
17.5 < I < 22.5. They also show a colour-redshift diagram 
that indicates that the red galaxy population (V — I > 1.6) 
has a median redshift of about z ~ 0.8 (Crampton et 
al 1995, their figure 5). More accurately we can integrate 
the best fit Schechter function found by Lilly et al. (1995) 



Figure 4. The distribution of red I-band background sources for 
Abell 1689. Darker grey areas indicate an underdensity of source 
counts. The image is Gaussian smoothed with a smoothing scale 
of 0.35'. The peak of the light distribution is at (4.1', 3.6'). The 
maximum density of objects is 23.0 per square arcminute and the 
minimum is 1.1 per square arcminute. There are 15 contour lines 
spaced by An = 1.46 galaxies per square arcminute. A strong 
caustic feature is seen 0.75' from the peak (inner dashed line), 
more visible to the south— west, as the other side of the peak is 
masked over. A second feature is found in the radial profile at 
2.2' (outer dotted line). The image is orientated with East to the 
left and North to the top. 



for the CFRS red galaxy population. This has parameters 
<t>* = 0.0031 ± 0.00095, a = 1.03 ± 0.15 and M*{B) = -21, 
where M(B) = / - 5 log 10 (D L /10pc) + 2.51og 10 (l + z)+k- 
correction, where the k-correction is discussed in their paper 
and Dl = (1 + z)r(z) is the luminosity distance. Lilly et al. 
found no detectable evolution of the luminosity function of 
the CFRS red population and we assume no evolution. Ex- 
trapolating to the magnitude range 20 < / < 24 we find that 
the redshift distribution can be well fitted by the function 



az 



2 = 



■exp(-(z/z«) a ) 



(22) 



z?r[3/a] 

with a = 1.8 and z* = 0.78 to about 5% accuracy over 
a redshift range of 0.25 < z < 1.5. The moments of this 
distribution are 



[Z > = z. 



r[(3 + n)/a] 
r[3/a] 



(23) 



Hence for the red galaxy population we find that (2) = 0.96 
and <7 Z = 0.42. To simplify the analysis of the lensing proper- 
ties of the cluster, we shall assume hereafter that the back- 
ground distribution is at a single redshift of 2 = 0.8 (the 
mode of the distribution) and has an uncertainty of Sz = 0.4. 

As the caustic indicated by the blue arcs coincides with 
the magnification caustic, we can presume that the galaxy 
forming the arcs lies at the same redshift as the magnified 
red background galaxies, 2 « 0.8. At present we do not know 
the redshift of this arc. 
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that log 10 n(blue) w 0.35/ — 3.49, resulting in (3b = 0.; 



I 

Figure 5. Magnitude distribution of all I-band galaxies (solid 
dots), the red selected galaxies (grey dots) and the blue back- 
ground galaxies (open dots). The lines are the best fits to the 
data. 



5.2.2 Number counts of the background galaxy population 

Of major importance to the lens magnification method is 
the normalisation of the background galaxy population. The 
CFRS is not adequate for this, since their colour cuts were 
in the rest frame U — V, rather than the observed V — I. 
Instead we have used the Keck data of Smail et al (1995), 
who observed deep VRI images down to a limiting magni- 
tude of R w 27. The total differential galaxy count rate in 
the I-band can be approximated by 



log'i 



(0.271 ±0.009)/ - 1.45 



(24) 



over the range 20 < / < 24, where n is per magnitude 
per square degree. We have applied our colour criteria (see 
section 5.1) to the Keck data and find that the red galaxy 
population V — / > 1.6, can be well approximated by 



log 10 n(red) = (0.0864 ± 0.0187)/ + (2.12 ± 0.41) 



(25) 



over the range 20 < / < 24. Figure q shows the mag- 
nitude distribution for the full dataset and for the red- 
selected galaxy population and the best-fit lines. Integrat- 
ing the fit for the red galaxies yields a total count rate of 
n = 12.02 ± 3.37 galaxies per square arcminute in the range 
20 < / < 24. Since (3 = 2.5 d log 10 n/dm, we find that the 
Keck data implies /3r — 0.216 ± 0.047. This is the value of 
(5 we shall use in the subsequent analysis. 

An alternative, although less exact, method of normal- 
isation is to assume negligible cluster mass at the edge of 
the field and normalise the cluster to this. In general this 
would put a lower limit on the mass, and is similar to the 
method used to normalise shear mass maps. In fact if we do 
this for A1689 we find a background count rate very sim- 
ilar to that given by the Keck data. The error introduced 
into the final mass estimate by uncertainties in /3 scale as 
8n/n w (5/3/1 1 — P\, which for the Keck data results in a 
fractional error of around 5%. 

We have also fitted the blue counts in the Keck sample 
(Figure |B|). Over the same range as the red counts we find 



close to the lens invariant j3 — 
tween 23 < / < 24 of n (blue) 
arcminute. 



and a count density be- 
15.5 galaxies per square 



5.2.3 Clustering properties of the background population 

The amplitude of clustering of I-band galaxies and its de- 
pendence on redshift can be estimated from the CFRS (Le 
Fevre et al. 1996). Le Fevre (1996) find that there is little 
difference between the clustering properties of red and blue 
populations of galaxies for z > 0.5, implying that the pop- 
ulations were well mixed at this epoch. We therefore apply 
their clustering results directly to our red galaxy population. 
They fit their results to a power-law model for the evolving 
correlation function, £(r) = (r/ro) -7 , where 



ro(z) =r (0)(l + ^r (3+£)/7 



(26) 



where e = 1 ± 1 and r {z = 0.53) = 1.33 ± 0.09/i _1 Mpc and 
7 = 1.64 ± 0.05 is in this section the slope of the correlation 
function. 

The quantity we require is the variance in a given area 
of sky, which can be estimated by averaging the observed an- 
ular correlation function, u}(9), over a given area (equation 
The clustering variance for I-band galaxies then scales 
roughly like (Taylor & Dye 1998) 



= 10" 



\9/l') 



(27) 



where the sampled area is a circle of radius 9 and we have 
assumed unbiased, linear evolution of the density field. The 
background galaxies are assumed to all lie at z w 1. 

5.3 Reconstructing the surface mass density 

In Figure ^ we plot the reconstructed surface mass density 
of Abell 1689 using the nonlinear local sheet approximation, 
Ko (see section 4.1), changing parity on the caustic line at 
9 = 0.75' (see Figure ^). The uncertainty on the peak of 
the mass distribution is somewhat large (see Section 5.2), 
but significant features can be seen around the cluster core. 
There appears to be an extension to the south-west not seen 
in the cluster galaxy distribution. Interestingly there also ap- 
pears to be a loosely connected ridge, about 2.4' from the 
peak. We shall discuss this feature further below, but note 
that the shear mass map derived by Kaiser (1996; figure 2) 
shows similar extensions and ridge, although the extension 
to the west is not apparent in the shear map. Two under- 
dense regions are also seen to the south and to the east in 
both maps. While the comparison is only qualitative and the 
maps are noisy, we find this very encouraging as these maps 
are derived from completely independent methods, although 
the underlying data set is the same. 

5.4 The Mass Profile of Abell 1689 

While the mass maps are suggestive, a more quantitative 
measure can be made by angle averaging the counts and cal- 
culating the mass profile. Figure [j] shows the radial counts 
about the peak in the light distribution, normalised to the 
Keck data. The plotted error bars are only due to Pois- 
son statistics, although in the mass analysis below we shall 
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Figure 6. Reconstruction of the surface mass density of Abell 
1689 from the red background galaxy population, using the non- 
linear local sheet approximation and a full likelihood analysis in 
2-D. Light regions are high density. Only one caustic line is as- 
sumed, at 9 = 0.75' from the peak of the light distribution. The 
maximum surface density is k = 1.35, at (4.02', 3.41'), consistent 
with the peak in the light distribution. The minimum surface 
mass density is n = —0.47. There are 15 linearly spaced contours, 
separated by Ak = 0.12 and the map is Gaussian smoothed with 
a smoothing length of 9g = 0.35'. North is up and east is to the 
left. 
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B (arcmin) 

Figure 7. Radial profile of red counts behind Abell 1689. The 
background count density is no = 12 objects/arcmin 2 . Superim- 
posed is the profile for an isothermal model, normalised at the 
caustic radius, 9 = 0.75' (dashed line). 

take into account the effects of clustering. A general trend 
is clear, and lies close to the prediction for an isothermal 
lens normalised to the blue arc caustic. This has a surface 
mass density of k — O.375(0/l') - , corresponding to a virial 
velocity of 1600 kms -1 . Again it is worth emphasizing that 
the zero of the number counts at 9 = 0.75' corresponds to 
the caustic inferred from the blue arcs. The second dip will 
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Table 1. Table of angular radius (r in arcminutes), number of red 
galaxies (N), ratio of galaxies to background (N/No), the total 
area of the annuli (areas are in arcmin 2 ) and area obscured by 
the mask. The unobscured area is total area — obscured area. The 
expected number of galaxies in an annuli is -/Vo = no X unobscured 
area 

be discussed in more detail in section 5.4.4. The increase in 
counts at 6 = 3.7' is likely to be due to a clustering effect. 
W I I 
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Figure 8. Radial profile of blue counts behind Abell 1689. The 
background count density is no = 22 objects/arcmin 2 . As ex- 
pected from the nearly lens invariant slope /3bi ue = 0.88, the 
number counts are nearly fiat, and at large radii tend towards 
n/no = 1. The slight increase towards the cluster center is prob- 
ably due to contamination of the counts by blue cluster members 

In Figure ^ we show that radial profile for the blue 
galaxy population, normalised with the Keck data in sec- 
tion 5.2.2. As expected there is no lensing signal. The slight 
increase towards the cluster center is due to contamination 
from the blue cluster members. 

5.4-1 Local approximations for the surface mass density 

Fig. |^ shows the radial mass profile of the cluster Abell 1689 
assuming a single caustic at = 0.75'. The two solid lines 
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Figure 9. Radial profile of surface mass density of cluster Abell 
1689. The dark solid region shows the uncertainty due to the 
strong lensing estimators. The lighter shaded region is due to the 
clustering and shot-noise uncertainty of the background popula- 
tion. The solid line is a singular isothermal profile, normalised to 
the caustic feature at 8 = 0.75'. 




(arcmin) 

Figure 10. Radial profiles of surface mass density, ft, for A1689 
(solid line with dots), calculated by solving the axially symmetric 
lens equation jl9|). The shaded regions are la errors calculated via 
error propagation from the uncertainty on the measured amplifi- 
cation profile. The solid dark line is a singular isothermal profile 
normalised to the caustic feature at 8 = 0.75'. The lighter solid 
line is the local best fit estimator, k c 



are calculated using the Logriormal-Poisson likelihood esti- 
mator (equation ^) with each of the two strong lensing ap- 
proximations (equations ^ and ^| ) . The light shaded region 
indicates the la uncertainty due to both shot noise and the 
effects of clustering. The dark shaded region indicates the 
region between the two extreme estimators. Away from the 
cluster center these agree and are equal to the weak lensing 
estimator, but noise effects become dominant. Closer to the 
cluster center the uncertainty due to the shear increases and 
becomes dominant at 6 < 1'. However the cluster mass pro- 
file is significantly detected between 1' < 8 < 2.6'. We also 
appear to see a deviation from an isothermal profile, which 
is also plotted. When the procedure was repeated with the 
center of the annuli offset from the peak of the light distribu- 
tion the mass profile was weaker and less significant, as one 
would expect if the peak of the mass density was associated 
with that of light. 



5.4-2 Nonlocal approximation for the mass density and 
shear 

In Figures and[y] we assume axisymmetry and equation 
(^) to calculate the surface mass-density and shear simul- 
taneously. We set 71 = 0.3 for the first shell. The resulting 
profile is fairly insensitive to this choice, only affecting the 
first 2 shells. The uncertainty on the shear in the first shell 
is small because this must be chosen a priori. However av- 
eraging over shells means that the errors do not strongly 
propagate through to higher radii. Again a mass detection 
is found between 1' and 2.8', this time with the shear ac- 
counted for. In this region k w 0.4T0.15, which is somewhat 
higher than that found by the shear estimate of n — 0.2±0.1 
(Kaiser 1996; note that we quote Kaiser's colour selected 
sample, where cluster members that may contaminate the 
shear estimate have been removed. This corresponds to com- 




(arcmin) 

Figure 11. Radial profiles of tangential shear, 7t, for A1689 (solid 
line with dots), calculated by solving the axially symmetric lens 
equation (p^|). The shaded regions are la errors calculated via 
error propagation from the uncertainty on the measured amplifi- 
cation profile. The solid dark line is a singular isothermal profile 
normalised to the caustic feature at 8 = 0.75'. The lighter solid 
line is the local best fit estimator, k c . 
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bining our red and blue background populations. This will 
change the redshift distribution of the background and in- 
clude some residual blue cluster contamination which may 
account for the discrepancy). Also, for the single caustic so- 
lution, we see a large spike at 2.2', which is not seen in the 
Kaiser (1996) results. However the shear method correlates 
points, which may lead to both the suppression of features, 
and underestimation of the errors. 

Our estimate of the shear field is far more uncertain, 
with 74 = 0.2 ± 0.3 over most of the range. There is a slight 
increase beyond 2.4' due to the spike in the surface mass pro- 
file at that radius, but the profile is dominated by noise. This 
increase is not reflected in the angle averaged measurements 
of Kaiser (1996), where the mean shear is 7 = 0.15 ± 0.05. 



5.4-3 Local approximation for the surface mass density 
and shear 

Figures [ji] and [H] also show k and 7 estimated from the 
best-fit solution of section 4.1. We find good agreement be- 
tween the local and nonlocal approximations for k, but the 
the shear profiles are somewhat different, reflecting that one 
estimator is local and one nonlocal. However, the large un- 
certainties produced by each estimator means that we can- 
not predict the shear profile with much certainty from the 
available data. 



5.4-4 Two background populations ? 

An interesting feature of the counts in Figure (?] is the ap- 
pearance of two pronounced dips, one at 0.75' and another 
one at 2.2'. While the inner dip has already been identified 
with a caustic line, the outer dip is somewhat anomalous. A 
number of possibilities could account for this. The feature 
was noted in the mass plot as a low signal-to-noise ridge in 
the density and can be seen in the number counts as a loosely 
connected ring about the cluster center. One possibility is 
that this is due to clustering in the background population, 
combined with a large mass concentration to the south-east 
of the peak in the light distribution. There are few cluster 
members in the region of the ridge, or the bump, so the 
effect is not due to masking. 

An alternative is that this is the first glimpse of a second 
caustic line. In principle a second caustic can be created by 
placing the background galaxies at two redshifts, one at low 
redshift, one at high redshift (eg, Fort, Mellier & Dantel-Fort 
1996). The observed number counts would then be given by 



8-1 
2 



(28) 



where Ai = A(fi) with ft — K(zi)/n 



\ZT+zE)/(vT+Zi - 1) (BTP), and i = 1,2 for the two 
galaxy populations, v is the fraction of galaxies at redshift 
22. An outer caustic line must be produced by the high red- 
shift population. If we make this population lie at z = 0.8, 
then the low redshift population must lie at z — 0.3. Both 
populations are reflecting the same arc, the difference in 
projected radii is wholly due to their relative redshifts. 

However this would double the predicted mass from lens 
magnification, making Abell 1689 a very extreme cluster. 
In addition it seems hard to make a caustic line from the 
high redshift population for such a massive cluster without 



forming a second, inner radial caustic. As the strongest arc 
is tangential and is seen near the inner arc, one would have 
to conspire to have a nearby galaxy, at z — 0.3, lensed and 
lying at the same projected radii as the radial arc produced 
by the high redshift population. This seems highly unlikely. 

One could also keep the mass roughly constant and 
place a second population at z > 0.8. This is a possibil- 
ity, but does not strongly affect our mass estimate assuming 
a single caustic solution. In the absence of further evidence 
for a second high redshift population, we shall only consider 
the single caustic model. 



5.5 Mass estimate of Abell 1689 

5.5.1 From K to mass surface density 

Assuming that the background galaxies all lie at the same 
redshift of * = 0.8, and given that the surface density scales 
as 



E = is (VTTz- - + z L ) 2 

3 (Vr+^I- lXv/r+i-v/r+^Z)' 



(29) 



where Eo = 8.32 x lO 14 /iM0Mpc~ 2 is the mean mass per 
unit area in the universe, then we find that the surface mass 
density is 

E = 5.9 x 10 15 k [hM Q Mpc~ 2 ]. (30) 

Although we have assumed an Einstein-de-Sitter universe, 
these results only depend weakly on cosmology (BTP). 



5.5.2 Uncertainty in the redshift distribution 

The error introduced by assuming the background galaxies 
lie at the same redshift can be estimated by error propa- 
gation and assuming Sz = 0.4 (see section 5.2.1). Hence 
<5E = \d'E/dz\Sz and the fractional uncertainty on the sur- 
face mass density due to the uncertainty in redshift distribu- 
tion of the background galaxies is <5E/E = 0.3752 = 0.148. 
The same error is also found in mass estimates based on the 
shear pattern. 



5.5.3 Uncertainty due to normalisation of background 
counts 

Assuming a sheet mass solution (fto in section 4.1) we find 
that the uncertainty due to the normalisation of the back- 
ground counts is Sn — (|1 — k|/2|/3 — l[)<Sno/no. For A1689 
and the red galaxy population this is Sk = 0.15|1 — k\. For 
an average n = 0.5 the uncertainty is around 8k = 0.07. 



5.5.4 The cumulative mass distribution 

Figure ^ shows the cumulative mass interior to a shell, cal- 
culated from both the nonlocal approximation (section 4.2), 
and the best-fit local approximation allowing only a single 
caustic solution (section 4.1). The uncertainties are treated 
by error propagation. We find that the 2d projected mass 
interior to 0.24/i~ 1 Mpc is 

M 2d (< 0.24/i _1 Mpc) = (0.50 ± 0.09) x 10 15 /i _1 M Q , (31) 
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Figure 12. Cumulative mass profile of Abell 1689. The solid 
dark line and shaded uncertainties are estimated using the axi- 
symmctric nonlocal estimator described in section 4.2. The lighter 
grey line is the cumulative mass estimated from the best-fit local 
approximation, k c described in section 4.1. Also plotted is the 
isothermal fit to the blue arc caustic (dotted line), similar to the 
shear results of Kaiser (1996) and Tyson & Fischer (1995). 

and that the two estimators are in good agreement. We find 
that the projected mass scales like 

M 2d (< R) « 3.5 x lO 15 (i?//i" 1 Mpc) 1 ' 3 /i- 1 M , (32) 

for R < 0.32/i _1 Mpc, similar to that for an isothermal 
sphere; M ~ R. Hence it appears that A1689 has a near- 
isothermal core. Beyond R = 0.32/i~ Mpc the lensing signal 
is lost in background noise, and we can only say that k < 0.1 
Including the uncertainty from the background redshift 
distribution and the normalisation of background counts in- 
creases the error to about 30%. 



6 COMPARISON WITH OTHER MASS 

ESTIMATES OF A1689 AND INFERRING 
THE 3D MASS DISTRIBUTION 

In this section we compare the mass derived from lens mag- 
nification with that found from a number of other, indepen- 
dent measurements. Firstly we compare our results with esti- 
mates of the mass based on the shear pattern found around 
A1689 (section 6.1). The magnification and shear comple- 
ment each other in that the shear pattern has a higher 
signal-to-noise, since it is not affected by clustering noise 
(although with redshift information the magnification can 
also be measured free from clustering noise - see BTP), but 
suffers from the "sheet-mass" degeneracy. We shall combine 
the magnification and shear pattern elsewhere. 

While the lens magnification mass is vital for fixing the 
total 2d projected mass distribution independently of any 
assumptions about the dynamical state of the cluster, much 
information can be gained by combining this with other 
mass estimates, assuming that these are not strongly bi- 
ased by their reliance on thermodynamical equilibrium. In 
this section we describe a method for transforming from the 



2d lens mass to other cluster characteristics, such as the 
line-of-sight velocity dispersion (section 6.3) and the X-ray 
temperature (section 6.4). Discrepancies that arise between 
these predicted characteristics and the actual measurements 
can be used to infer information about the mass distribution 
along the line of sight (Bartelmann & Kolatt, 1997). We find 
that while there is fair agreement between all of the mass 
estimates when projection effects are taken into account, the 
agreement is better if the cluster A1689 is composed of two 
clusters superimposed along the line of sight and separated 
by about Az = 0.02. 

The transformation from a 2-d projected lensing mass 
to a 3-d mass, line-of-sight velocity dispersion and X-ray 
temperature can be made using either the isothermal model, 
or by using relations found in N-body simulations of clusters. 
While the former provides a simpler method, one has more 
freedom with simulations to include or exclude the various 
projection effects that contaminate measurements of these 
quantities. In this section we shall use the relations found 
by van Kampen (in preparation) from an ensemble of CDM 
cluster simulations, all with Qo = 1 and as = 0.54. These 
relations are model dependent, but serve to aid comparison 
between the various mass measurements. We have also pro- 
vided a table of quantities (Table 1) where the uncertainties 
have been calculated by combining the error on the cluster 
mass with the dispersion found in the deprojection relations. 

We begin by comparing the lens magnification mass 
with the mass determined from the shear field around 
A1689. 

6.1 Comparison with arclets and weak shear 

Tyson & Fischer (1995) provide mass profiles of A1689 from 
arclets, another independent estimator of the mass, normal- 
ized to the caustic line indicated by the blue arcs. They find 
that the 2d projected mass within R — 0.1/i -1 Mpc is 

M 2d (< 0.1) = (0.18 ± 0.01) x 1O 15 /i _1 M . (33) 

They also find that the mass scales like an isothermal sphere 
out to 0.4/i -1 Mpc, before turning over to an R~ 14 profile. 
This implies that in the regime we probe with the magnifi- 
cation the cumulative mass scales like 

M(< R) = (1.8 ± 0.1) x W 1B (R/h~ 1 Mpc)h~ 1 Mt 3 . (34) 

This is very close to the profile we find from lens magnifica- 
tion (equation |3^). Using this we scale their results giving 

M 2d {< 0.24) = (0.43 ± 0.02) x 1O 15 /i _1 M , (35) 

in good agreement with the mass from magnification. 

Kaiser (1996) has also calculated k based on the weak 
shear method (Kaiser & Squires, 1993), using the same data 
we have used here for A1689. We noted above that there 
are qualitative similarities between the weak shear maps 
and those presented by Kaiser, which is significant since the 
methods are independent. The mass-density profile found 
from the shear pattern is also well fitted by an isothermal 
profile; 

M 2d (< R) = 1.8 x lO 15 (i?/ft~ 1 Mpc)/ ! ,- 1 M , (36) 

with a 10% statistical uncertainty and further 10% system- 
atic error due to the uncertainty in the redshift distribution 
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(section 5.5.2). Compared with our 2-d mass Kaiser's anal- 
ysis suggests that 

M 2d (< 0.28) = (0.43 ± 0.04) x lO 15 /i -1 M , (37) 

again in good agreement with that found by the magnifica- 
tion method. 

6.2 The 3d mass estimated from lensing alone 

The 3-d mass inferred from the 2-d projected mass inside a 
sphere of radius r — 0.5/i -1 Mpc is 

M 3d (< 0.5) = (0.72 ± 0.25) x lO 15 /i -1 M , (38) 

while the mass inside an Abell radius, r = 1.5/i -1 Mpc, is 

M 3d (< 1.5) = (1.6 ± 0.6) x 1O 15 /i -1 M . (39) 

These estimates are probably an overestimate of the true 
3d mass since the dispersion in the simulations includes the 
effect of the alignment of the clusters' principle axis along 
the line of sight. Given that the inferred 3d mass is so high 
A1689 is probably lying at the extreme of such a distribu- 
tion. In such cases the 3d mass may be much lower than mass 
inferred from a 2d projection. We discuss this possibility in 
the next few sections. 

6.3 Velocity dispersion of Abell 1689 

The predicted line-of-sight velocity dispersion estimated 
from the simulations also includes the effects of superposi- 
tion of clusters, infall along filaments and interlopers and so 
predict larger velocities than isolated clusters would. Given 
this we find that the observed line-of-sight velocity disper- 
sion inferred from the 2d lensing mass is 

a v (< 1.5/i -1 Mpc) = 3400 ± 900 kms -1 (40) 

for A1689. Again this is very much on the high side for a 
cluster in a typical CDM universe, and much higher than 
observational data suggests. 

The velocity dispersion measured in a similar manner 
by Teague et al (1990) is 2355±f|| kms" 1 , nearly la lower 
than our estimate. Hence the predicted velocity is not too 
different from the measured value, so long as both are mea- 
sured in the same way. The large mass implied by lensing 
suggests that A1689 is not a single cluster, but a superposi- 
tion of two smaller clumps. Miralde-Escude & Babul (1995) 
have modeled A1689 by two adjacent clumps with velocity 
dispersions 1450 kms -1 and 700 kms - , suggesting that one 
clump has a mass 4 times that of the other. Taken together 
and including the relative velocity of the merging clumps, 
this accounts for the larger velocities measured earlier, den 
Hartog & Katgert (1996) have tried to take into account 
interlopers and find that v = 1861 kms -1 . 

If, like Miralde-Escude & Babul, we assume a double 
cluster model but place one cluster at z = 0.18 with a ve- 
locity dispersion of 1500 kms -1 and the second at z = 0.20 
with a velocity dispersion of 750 kms - , we find that we can 
reproduce a total projected velocity dispersion of around 
2300 kms -1 . Figures 4 and 5 of Teague et al (1990) also pro- 
vide marginal evidence for a second concentration of galaxies 
at z = 0.2. Hence it seems plausible that the high lensing 
mass of A1689 can be explained by a superposition of clus- 
ters. 



Quantity This work Other 

M 2d (< 0.24) 0.50 ± 0.09 0.43 ± 0.02 (Tyson & Fischer) 

0.43 ± 0.04 (Kaiser) 

M 3d (<0.5) 0.72 ±0.25 

M 500 1.6 ±0.65 0.94 ±0.16 (Yamashita) 

a v (< 1.5) 3400 ±900 23551^3 (Teague et al) 



Table 2. Mass estimates for A1689 based on lens magnification 
(second column) and from other measurements (third column). 
Masses are given in units of 10 °h~ Mq and velocities are quoted 
in units of kms -1 . Distance are given in /i -1 Mpc. The other mea- 
surements are based on arclets (Tyson & Fischer 1995), the shear 
pattern (Kaiser 1996), X-ray temperatures (Yamashita 1994) and 
line-of-sight velocity dispersion (Teague et al 1990). Also given 
are the 3d mass estimates from lens magnification. 

6.4 X-ray mass estimates of Abell 1689 

Evrard, Metzler & Navarro (1996) have found that the mass 
within the radius defined where the mean cluster density is 
500 times the critical density is strongly correlated with the 
cluster temperature. They fit this relation from simulations 
by 

M 500 = 1.11 x 10 15 (y^) ^ ft _1 M , (41) 

where Tx is the broad-beam temperature and M500 is the 
3d mass within a radius defined by an overdensity 500p cr it • 
This radius is roughly given by rsoo = 1.175/i -1 Mpc. 

X-ray temperatures of A1689 have been measured by 
both GINGA and ACSA. Yamashita (1994) has analysed 
this data and finds T — 9 ± IkeV while Mushotzky and 
Scharf (1997) find T = 9.02j£jkeV. Daines et al (1997) 
have also recently re-analyzed ROSAT PSPC observations 
and find a mean temperature of Tx = 10.2±4keV. Note that 
we are quoting the mean temperature, and incorporated the 
40% uncertainty in the error estimate, rather than quot- 
ing upper limits as Daines et al do. The major uncertainty 
in measuring X-ray temperatures here is instrumental, as 
lOkeV is approaching the limit of ROSAT's sensitivity. 

Taking the result of Yamashita and the relation found 
by Evrard et al, we find that 

M500 = (0.95 ± 0.16) x 10 1B /i -1 M o . (42) 

Using the simulated scaling relations we find 

M500 = (1-6 ± 0.65) x lO 1B /i -1 M (43) 

for Abell 1689, implying an X-ray temperature of Tx = 
12.7 ± 3.4keV, within the la uncertainty of the measured 
X-ray temperature. Again, if we consider A1689 as a double 
cluster, the nearer, larger mass concentration would be de- 
tected in X-ray, lowering the expected X-ray temperature. 
From the velocity dispersions we can infer a temperature 
nearer to Tx = 0.7keV, slightly below, but again in agree- 
ment with observations. 

In conclusion, although we find a high mass, there is a 
general consistency between the mass of A1689 estimated 
from lens magnification and shear. In addition we find a 
fair agreement between the lens mass and the line of sight 
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velocity dispersion if we take into account projection effects. 
Modelling A1689 as a double cluster we find that the velocity 
dispersion can be much lower, implying two smaller clusters, 
with the lensing mass a superposition of cluster masses. This 
hypothesis might also help explain the marginal discrepancy 
with X-ray temperature. 

However, if A1689 is a double cluster, one would expect 
that the measured velocity dispersion would be higher than 
that inferred by the lensing mass, due to the cluster sepa- 
ration, while the X-ray mass is lower, since only the more 
massive cluster dominates the X-ray emission. Our results 
indicate that Abell 1689 has a larger lensing mass than that 
implied by both velocity dispersion and X-ray temperature, 
although given the large uncertainties it is hard to be con- 
clusive. 

Finally, A1689 is in projection a highly spherical clus- 
ter, in contrast with the majority of clusters which appear 
extended. While this may be be a result of its high mass, it is 
also possible that A1689 has its major axis aligned along the 
line of sight, pointing towards a second cluster. While much 
of the evidence on the mass distribution along the line-of- 
sight is circumstantial, all of these effects would conspire to 
give A1689 its impressively massive appearance. 



7 DISCUSSION 

The absolute surface mass density of a galaxy cluster can 
be estimated from the magnification effect on a background 
population of galaxies, breaking the "sheet-mass" degener- 
acy. To apply this in practice, we have taken into account 
the nonlinear clustering of the background population and 
shot-noise, both of which contribute to uncertainties in the 
lensing signal. A further complication is the contribution 
of shear to the magnification in the strong lensing regime, 
where the magnification signal is stronger. We have argued 
that this can be circumvented by approximate methods that 
can be either local, where a relationship between surface 
mass and shear is assumed, or by a nonlocal approxima- 
tion where only the shape of the cluster is assumed. Both 
approximations seem to work well on simulated data. 

We have applied these methods to the lensing cluster 
Abell 1689, using Keck data of Smail et al (1995) to nor- 
malise the background counts, and the CFRS results to in- 
fer the redshift distribution and clustering properties of our 
data. Using a 7 = approximation of the surface density in 
the strong lensing regime, we have reconstructed a 2-d mass 
map for A 1689 in the innermost 27 square arcminutes, where 
a substantial part of the lensing signal comes from. The 2d 
map has general features similar to those seen from shear 
maps (Kaiser 1996). This is encouraging for both methods, 
as they are independent determinations of the mass distri- 
bution. 

For a more quantitative measure we have binned the 
data in annuli around the peak in the light distribution and 
found a significant (5a) drop in the number counts, drop- 
ping to zero where a caustic is inferred from arcs. Local and 
nonlocal approximations were used to find the ft profile from 
the number counts, and estimate the shear field. We found 
these to be quantitatively similar to that found by the shear 
method. 

We have also discussed the possibility of a second pop- 



ulation of background galaxies, creating a second dip in the 
radial number counts and a spike in the mass profile. How- 
ever we argued that is is unlikely that there is a second low-z 
population as the cluster mass would be improbably high, 
and if there is a high-z population it has little effect on our 
results. 

We have calculated a cumulative mass profile for A1689 
and find a projected 2-d cumulative mass of 

M 2d {< 0.24/i _1 Mpc) = (0.50 ± 0.09) x 1O 15 /i _1 M . (44) 

Such a large mass is very rare in a CDM universe normalised 
to the observed cluster abundance, and may indicate that 
A1689 is composed of two large masses along the line of 
sight, and/or filaments connected to the cluster and aligned 
along the line-of-sight. This is also implied by the high 
line of sight velocity dispersion which would be enhanced 
by merging clusters (Miralde-Escude & Babul 1995) or by 
infall from aligned filaments. 

We have compared our mass estimates with other esti- 
mates available in the literature and find that the lens mag- 
nification, shear, arclets, line-of-sight velocity dispersions 
and the X-ray temperature mass estimates are all in reason- 
able agreement, to within the uncertainties at this time. 

The results presented here are from 3 hrs integration on 
the 3.6m NTT. Longer integration times have the combined 
benefit of increasing the number of background galaxies, and 
so reducing shot noise, and of reducing the contribution from 
cosmic variance (equation |2^, Section 5.1.3). Hence by in- 
creasing the exposure time we can expect to reduce the un- 
certainty from lens magnification by a factor of 2 or so. 

One drawback of this analysis is the contribution of 
clustering noise to the background counts. This can be re- 
moved using redshift information, either from spectroscopy, 
or more efficiently using photometric redshift information 
(BTP). We shall explore this elsewhere (Dye et al, in prepa- 
ration). 

If our results are extended to other clusters we can hope 
to have a good representation of the total mass distribution, 
gas and galaxy contents with which to make strong statisti- 
cal arguments about the matter content of the largest grav- 
itationally collapsed structures in the universe. 
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